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Abstract 

Signs of hierarchy are prevalent in a wide range of systems in nature 
and society. One of the key problems is quantifying the importance of 
hierarchical organisation in the structure of the network representing 
the interactions or connections between the fundamental units of the 
studied system. Although a number of notable methods are already 
available, their vast majority is treating all directed acyclic graphs as 
already maximally hierarchical. Here we propose a hierarchy measure 
based on random walks on the network. The novelty of our approach 
is that directed trees corresponding to multi level pyramidal struc¬ 
tures obtain higher hierarchy scores compared to directed chains and 
directed stars. Furthermore, in the thermodynamic limit the hierarchy 
measure of regular trees is converging to a well defined limit depend¬ 
ing only on the branching number. When applied to real networks, our 
method is computationally very effective, as the result can be evaluated 
with arbitrary precision by subsequent multiplications of the transition 
matrix describing the random walk process. In addition, the tests on 
real world networks provided very intuitive results, e.g., the trophic 
levels obtained from our approach on a food web were highly consis¬ 
tent with former results from ecology. 
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Introduction 


Hierarchical organisation is an ubiquitous feature of a large variety of sys¬ 
tems studied in natural- and social sciences. Examples of empirical studies 
on hierarchy are including the transcriptional regulatory network of Es¬ 
cherichia coli [T], the dominant-subordinate hierarchy among crayfish [2], the 
leader-follower network of pigeon flocks [as], the rhesus macaque kingdoms 
[S], neural networks [U] and technological networks [7], social interactions 
[a El HQ], urban planning mm, ecological systems mm, and evolution 
mm- Naturally, hierarchy is a very relevant concept also in network the¬ 
ory o im US I19j . The network approach has become an ubiquitous tool 
for analysing complex systems ranging from the interactions within cells 
through transportation systems, the Internet and other technological net¬ 
works to economic networks, collaboration networks and the society [201121] . 

Grasping the signs of hierarchy in networks is a non-trivial task with 
a number of possible different approaches. On the one hand, we may try 
the statistical inference of an underlying hierarchy based on the observed 
network structure, as suggested in Ref. [18]. On the other hand, the intro¬ 
duction of a hierarchy measure is also a natural idea |22[ 123] \2^ l25l [26] . In 
general, a hierarchy measure, can be viewed as a function on the domain of 
graphs, H : G e-?■ M, mapping a graph Q & G into a real number, H{Q) G M. 
The value of the measure is actually H{Q) G [0,1] or H{Q) G [—1,1] in 
most cases, with high values corresponding to hierarchical structures and 
low values indicating the absence of hierarchy in the examined network. 

One of the first methods was proposed by D. Krackhardt, motivated 
by organisational hierarchy, defining the hierarchy measure simply as the 
number of ordered pairs divided by the number of connected pairs [22]. In 
the approach introduced by A. Trusina et ah, the position of the nodes in the 
hierarchy is assumed to be given by the degree, and the hierarchy measure 
is given by the fraction of directed shortest paths going strictly upwards 
in the hierarchy [23]. Another way for quantifying the possible asymmetry 
between nodes is to assume some sort of flow on the links, and examine 
whether the global map of flows in the system is revealing a kind of overall 
directionality or not. Probably the simplest approach in this framework is 
to define the fraction of links not participating in any cycle as the measure 
of the hierarchy, as suggested by J. Luo and C. L. Magee [24] . 

A further important property of a hierarchical system is that reaching 
the rest of the network should be relatively easy for the nodes high in the 
hierarchy, and more difficult for the nodes at the bottom of the hierarchy, 
as pointed out by E. Mones et al. in Ref. [25] . The hierarchy measure based 
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on this aspect is given by the Global Reaching Centrality, characterising the 
inhomogeneity of the fraction of reachable nodes in at most m-steps in the 
network [25]. A more elaborate quantification of hierarchy was proposed 
by B. Corominas-Murta et al. in Ref. |26j with the introduction of Treeness, 
Feedforwardness and Orderability, projecting the studied network onto a 
point in a 3 dimensional space, where each dimension is aimed to capture a 
different aspect of hierarchy. Treeness, T, is measuring how ambiguous are 
the chain of commands in the network, while Feedforwardness, F is related 
to the size and position of the strongly connected components in the network. 
Finally, the orderability, O is simply the fraction of nodes not taking part in 
any directed cycles, i.e., it is analogous to the hierarchy measure introduced 
by J. Luo and C. L. Magee. The 3d scatter plots of T, F and O provided 
very interesting results, revealing different clusters of hierarchical networks 
|26j . A more detailed description and comparison between the mentioned 
methods is given in the Supplementary Information SI. 

Although the methods listed above allow the examination of the hierar¬ 
chical organisation from different perspectives, a noteworthy common aspect 
of these approaches that they all treat acyclic networks as already maximally 
hierarchical, independent of the further details of the graph structure. (The 
Global Reaching Gentrality given in Ref. [25] is an exception, considering 
the star conhguration as the most hierarchical). Here we argue that differ¬ 
ent acyclic networks are not necessarily equally hierarchical. The general 
intuition of a hierarchy is usually corresponding to a multi level pyramidal 
structure, with levels becoming wider and wider as we descend from the 
root towards the bottom. On the one hand this way the top nodes in the 
hierarchy can reach most of the network in a very effective way, i.e., via 
paths of average length scaling as InAi, where N denotes the number of 
nodes. On the other hand, in this structure all nodes can have a treatable 
number of direct subordinates. In contrast, if we consider a directed chain, 
all the levels are of size one, and this is leading to a large average distance 
scaling as N. The other extreme limiting case of acyclic networks is given by 
the directed star conhguration, where all the nodes have a single incoming 
link from a central hub, and no further out-links. In this case the hierar¬ 
chy is consisting of only two levels, and the supposed leader in the network 
has to cope with a number of direct descendants scaling as N. Based on 
that, introducing a hierarchy measure preferring trees to chains and stars 
would be a substantial step towards achieving a more intuitive approach for 
evaluating the importance of hierarchy in a network structure. 

In this paper we tackle this problem with the help of random walks on 
the network. Random walks provide a fundamental model for stochastic 
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processes in a large variety of systems ranging from physics m, chemistry 
[28] and computer science [29] through biology and ecology [Ml EH to eco¬ 
nomics [32] and psychology [33] • In the current problem of quantifying the 
extent of hierarchy in a network structure, random walkers can be used to 
evaluate the rank of the nodes in the hierarchy. 

The basic idea is assuming an information flow on the links from nodes 
high in the hierarchy towards the lower levels, in a similar fashion as in 
case of a company, where the management is likely to send information and 
instructions to the employees on a regular basis. Given the network struc¬ 
ture, the source of information in the system can be traced back by sending 
random walkers traversing the links in reverse direction from all nodes. In 
case the density of the random walkers is reaching a steady state, its value 
at a given node can be interpreted as the probability that the node was 
the source of information. Consequently, high random walker density values 
indicate a high standing in the hierarchy, whereas low density values are cor¬ 
responding to bottom nodes. The significance of hierarchical organisation in 
the network structure can be judged based on the inhomogeneity of the dis¬ 
tribution: In a homogeneous distribution we cannot pinpoint the source of 
information, thus, it is corresponding to a non-hierarchical network. In con¬ 
trast, a very inhomogeneous distribution is indicating a strongly hierarchical 
structure. 


Random walk hierarchy measure 

The details of the random walk process are the following. Since the random 
walkers are traversing the links backwards, the transition probability for a 
walker from node j to i is proportional to the inverse of the in-degree of j, 
i.e., P{j —>■ f) oc IjkJ^. Another important factor to be taken into account is 
the limited capacity of the information sources for sending information: In 
general we can assume that the more out-neighbours a given node has, the 
less resource it can allocate for managing the communication over a given 
link. This effect can be taken into account by assuming that P{j —>■ i) is 
also proportional to the out-degree of i, i.e., P{j —>■ f) oc Combining 

the above factors together is resulting in 

P{j -^i) = (1) 

for the transition probability of the random walkers from node j to i. (In 
case i is not an in-neighbour of j the transition probability P{j —>■ i) is zero 
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by definition). We note that due to the second factor on the right hand 
side of ([T|), the probability for staying at the same node can be non-zero in 
general, given by P{j —)• j) = 1 — 0- For weighted networks 

© can be naturally generalised to 


where Wij denotes the weight of the link from i to j. 

In case of acyclic networks, all random walkers eventually converge into 
nodes with no incoming links, (i.e., the “sources” in the network). In order 
to avoid judging the importance of hierarchical organisation in the system 
solely based on these “sources”, we inject new random walkers into the 
network at every time step. The update rules are the following; 



1. We insert / random walkers into the system, increasing the random 
walker density on every node hy f /N, thus, the random walker density 
at node i given by Pi{t) is changing as 

Pi{t) + ^. (3) 


2. We let all random walkers in the system proceed on step, governed by 
the transition probabilities given in ([T]). By introducing a transition 
matrix T with matrix elements Tij = P{j —?■ i), the density of random 
walkers on node i after the transition can be expressed as 

N 

Pi{t+1) = (4) 

i=i 


3. The total sum of random walkers has to be normalised, i.e., we require 
Pi{t + ^) = 1- Since the sum of new random walkers added to the 
system was /, we have to simply divide the density of random walkers 
by 1 -|- / in order to fulfil the normalisation condition, 

Pi{t + 1)^ . (5) 


The above normalisation of the random walker density to unity after 
each iteration is equivalent to using “decaying” random walkers, having a 
weight decreasing by a factor of (1 + f)~^ in each step. Let us denote 
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the characteristic distance under which the weight of a random walker is 
decreased to by A, fulfilling 

(l + /)-" = e-^ (6) 

Based on that, / can be also expressed as 

/ = - 1. (7) 


Although A, (or equivalently, /) is a parameter of the method at the current 
stage, later on in the Results we shall find a natnral condition for fixing A 
at an optimal valne in general. 

Our main object of interest is the stationary distribution of the random 
walkers in the network. By writing this distribution in a vector form of 
pStat^ where the i-th component of the vector, is corresponding to the 

random walker density on node i, we can derive a simple equation based 
on the update rules. Adding f/N new random walkers at each node is 
simply incrementing each vector component by f/N, while the transition 
to the next site by the random walkers corresponds to multiplying by the 
transition matrix T. Finally, the normalisation of the random walker density 
simply multiplies each vector component by 1/(1 + /). Based on the above 
the stationary distribution fulfils 


P 


stat 


1 

T+7 


T 



= e-i/^ 


T 



gi/A _ I 

N 



( 8 ) 


where 1 is corresponding to a vector of size N with all elements equal to 1. 
By expressing we obtain 


f 1 — 1 

,stat = L[(1 + f)l - T]-' T1 = 


jiAj. T 


1 -1 


Tl, 


(9) 


where I is denoting the identity matrix. Since T is a left stochastic matrix, 
the absolute value of its largest eigenvalue is 1. Consequently, the absolute 
value of the eigenvalues of are smaller than 1, and therefore, ([9]) can 

also be written as 
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stat 



giA _ 1 


N 


f;(e-AT)”i. 

n=l 


( 10 ) 


This formula is very intuitive, showing explicitly that the stationary 
distribution of random walkers at a given node is given by the sum of the 
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probabilities of all the paths ending on the node, where the contributions 
from longer paths are suppressed exponentially as a function of the path 
length. Based on (mo]), can be computed very efficiently. If the 

size of the network is moderate, we can use ([9|) for obtaining exact results. 
However, if matrix inversion is becoming computationally expensive, a very 
good approximation of can be calculated according to ([TO]) , he., by 
carrying out the summation up to a certain finite limit Umax, the obtained 
result is converging to the exact exponentially fast. 

Our hierarchy measure is based on the inhomogeneity of the stationary 
distribution of the random walkers. There are several different possibilities 
for quantifying the inhomogeneity of a probability distribution in general, 
here we choose the relative standard deviation, (also called as the coefficient 
of variation). Thus, the random walk hierarchy measure is defined as 

H = fill 

^(pstat)’ 1 ) 

where and cr(p®*®'*) denote the mean and the standard deviation 

of pf®'* respectively. Since — 1) ^he mean is given simply by 

/i(p®*at) = 1/A^, and our hierarchy measure can be expressed as 


H = N 
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2=1 


Results 


Hierarchy of acyclic networks 


For demonstrating the sensitivity of our measure to the topology also in 
case of acyclic networks, first we evaluate H for chains, regular trees with a 
constant branching number b, and stars. According to calculations detailed 
in Methods, the corresponding hierarchy values can be expressed as 


H. 


chain 


f^tree 


-f^star 


\ 


2 ei3-2N)/\ (g7V/A _ (giV/A _ gl/A) 
N (gl/A_l)2(gl/A + i) 

^max 

iV^6«-i(pf'^t)2 - 1, 

i=i 


^{N - l)e2/^ 
(A^-1) (eVA_i) +1’ 


(13) 
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where N is the number of nodes in the networks, I denotes the levels in case 
of the tree (starting from Z = 1 at the root and ending with /max at the leafs), 
and is corresponding to the stationary distribution of the walkers on 
level I in (|14p . which can be obtained from a simple recursion written as 


Pi 


,stat 

max 


^stat 


Pi 


stat 

Pi 


— 1 


h-i 


N 1 + — 6’ 

^ „stat , - 1 

1 + 6eVA _ 


26-1 


N 1 + 6e^/'^ — 6 


stat 2 


(16) 

(17) 

(18) 


In FigHJ we compare the hierarchy measures given in (|13II15|) at A = 2 
(Figll^) and at A = 4 (FiglDa). Our construction algorithm for the trees 
with a branching number 6 was to start by adding 6 links to the root, then 
move to the second level and subsequently add 6 links to every node on 
this level, and so on, move to the next level only when the given level was 
completed. Whenever the number of nodes in the tree has reached the 
algorithm terminates, and naturally, the resulting tree is not completely 
regular in most of the cases. Nevertheless, the overall structure of the trees 
obtained in this way is getting closer and closer to regular trees as N is 
increasing. 

According to Fig{T] the H for the chain and the star configurations has 
a peak at very small system sizes, and shows a decreasing tendency for 
growing N. In contrast, for regular trees H seems more or less converging 
to a finite value. Thus, above a certain N it is the structure of the tree, 
(encoded in the branching number), what determines the hierarchy measure, 
not the size of the tree. This is indicating that H is behaving similarly to 
intensive quantities in physics in some aspects. The “intensive” property 
of the hierarchy measure is analysed in more details in the Supplementary 
Information S2, here we note that if we take a pair of graphs Qi and Q 2 
which are not connected to each other, then H for the union of the graphs 
is equal to the weighted quadratic mean of the H values calculated for the 
graphs separately. 




' N^[Hg,f + N2[Hg,f 


N1+N2 


(19) 


Thus, in the special case of a pair of isomorphic graphs Hg^yjg^ = Hg^ = Hg^. 

We continue with the examination of the behaviour of H in the ther¬ 
modynamic limit. According to calculations detailed in Methods, when the 
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Figure 1: Comparing the random walk hierarchy for chains, regular trees 
and stars, a) The behaviour of as a function of N for a chain (black), a 
regular tree with branching number b = 3 (red), a regular tree with 6 = 5 
(green), a regular tree with 6 = 7 (blue) and a star (orange) at A = 2. b) 
The same plot as in a) when A is set to A = 4. 

system size is diverging, N ^ oo, the hierarchies given in ()13II15I) take the 
form of 


-^chain OC 

7V-V2^ 


(20) 


[ / (6-l)e2/> 
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Htjcee — 
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(21) 


oo 
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JV-V2_ 


(22) 


Thus, the hierarchy measure is vanishing for a chain and a star in the ther¬ 
modynamic limit. In contrast, TItree is converging to a well defined finite 
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limit value or 6 < 1 < oo when A is smaller than a b dependent critical 
value, and is diverging otherwise. In the Methods we show that the critical 
A value is given by 



Figure 2: 2d plot of the random walk hierarchy measure H for infinitely large 
regular trees as a function of the branching number b and the parameter A. 
The formula for H is given in (j21ll . At 6 = 1 we recover the infinitely large 
chain, while the infinitely large star is corresponding to the 6 —?■ oo limit. 
The dashed line is showing the maximum place of H. 

The behaviour of the limiting i^tree given in (1211) is shown in a 2d plot 
in Figl2j as a function of b and A. At 6 = 1 the tree becomes equivalent to 
an infinitely large chain, and according to (j20p the H becomes zero. The 
2d surface displayed in Figl2j is consistent with this result, as it starts from 
m = 0 at 6 = 1 for all A values. Similarly, the H for an infinitely large star 
is also zero according to (1221) . The surface shown in Figl2j is consistent with 
this result as well, as we can see a decreasing tendency in // as a function of 
b in the large b regime. In the range of intermediate branching numbers we 
can observe an A dependent maximum in H. This behaviour is examined in 
more details in the Supplementary Information S3. 

Based on the behaviour of H in the thermodynamic limit, we can also 
fix the A parameter at an optimal value in general as follows. Since A is 
corresponding to the characteristic path length a random walker can traverse 
before “decaying”, on the one hand we would like to choose a A as high as 
possible. I.e., if A is small, the random walkers can explore only within a 
very limited range from their origin, thus, the information we can retrieve 
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via the random walkers is also very local. However, due to its self similar 
nature, hierarchical organisation can manifest on all length scales, therefore, 
we need random walkers travelling longer distances in order to be able to 
tell apart hierarchical and non-hierarchical networks. 

On the other hand, if A is too large, we may run into diverging hierarchy 
values according to (|21l) . which needs to be avoided in case of a well behaving 
hierarchy measure. Therefore, we fix A at a value as high as possible where a 
diverging H is avoided for sure even in case of infinitely large regular trees. 
According to (I23p . the minimum of Xc(b) can be found at 6 = 4, where 
Ac (6 = 4) ~ 4.48. Since the path length traversed by a random walker 
is increasing by unity under every iteration, it is also natural to set A to 
an integer value. Based on the above, the optimal setting for A is given 
by A = 4. In the rest of the paper we are assuming that A is set to this 
optimal value, and thereby consider our approach a parameter free method 
for measuring the amount of hierarchical organisation in the structure of 
networks. 

Since real world hierarchies are usually not as highly ordered as a regular 
tree with a constant branching number, we extended our comparison stud¬ 
ies of acyclic graphs also to general directed trees. By applying a simple 
algorithm detailed in Methods, we generated a large family of trees with 
branching numbers varying around a given average branching number (b) 
according to a shifted Poisson distribution. In FiglSl we show the average 
of the random walk hierarchy measure, (H) as a function of (b), calculated 
based on 100 realisations of trees consisting of N = 1000 nodes. According 
to the curve, the maximum of (H) is at an intermediate average branching 
number, where the structure of the network is really tree like, he., for low 
average branching numbers, (where the structure is basically a chain), and 
also for very large branching numbers comparable to the system size, (where 
the structure is close to a star), the obtained (H) values are considerably 
lower. 

Results on real networks 
St. Marks food web 

Here we apply our method for analysing the hierarchy of the St. Marks 
food web [31!, representing a part of the ecosystem of Goose Creek Bay, St. 
Marks National Wildlife Refuge, Florida, USA. The nodes of the network are 
corresponding to living compartments, (group of species) based on probable 
diet and life history characteristics. Thus, compartments range from single 
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Figure 3: The average random walk hierarchy (H) as a function of the 
average branching number (b) for general trees of = 1000 nodes, averaged 
over 100 instances. When (6) is close to one, the tree is basically a chain, 
whereas at very large branching number, its structure is close to a star. 


species (e.g., pinhsh) through a couple of species (e.g., gulf flounder and 
needlehsh) to large groups of taxa, (e.g., bacterioplankton). The links be¬ 
tween the nodes represent the feeding pathways, pointing from consnmers to 
their food sources, where the link weights are corresponding to the fractions 
of the consumer’s diet. 

The static distribution of the random walkers on the network dehned 
above can be calcnlated nsing ([9]). However, is dehning only a ranking 
between the nodes and does not provide the hierarchy levels in the first 
place. Therefore, we sampled and aggregated nodes into levels so that in 
each level, the standard deviation of is lower than a pre-defined fraction 
of the standard deviation in the whole network. (This type of procedure for 
obtaining the hierarchy levels was established in [25]). 

In Figj4j we show the resulting hierarchy between the compartments 
when the standard deviation of pf^^ within the levels is at most 0.125 • 
iT(pStat). Xhe hierarchy levels are consistent with the common sense about 
food webs as e.g., benthic algae is on the lowest level, herbivorous ducks 
are somewhere in the middle, and raptors (such as e.g., the bald eagle) are 
on the top of the hierarchy. The colour coding of the nodes is showing the 
effective trophic level of the compartments given in |34] . ranging between 
1.0 and 4.32. Apparently, the position of the nodes in the hierarchy and 
their colonr are coherent in most of the cases, e.g., the root has the highest 
effective trophic level, and the nodes with the lowest trophic level are at 
the bottom of the hierarchy. However, a small number of discrepancies can 
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Effective 

trophic 

levei 



S. hake & sea robins: 
F. & c.-eat. birds: 
B.-eat. birds: 
Susp.-feed. moiiucs: 
A. s. & b. anchov.: 

S. minnow: 

Predatory gastrop.: 
E.-graz. gastrop.; 
Other gastrop.: 
Predatory poiych.; 


Southern hake & sea robins 
Fish & crustacean-eating birds 
Benthos-eating birds 
Suspension-feeding moiiucs 
Atlantic silverside and bay anchovies 
Sheepshead minnow 
Predatory gastropods 
Epiphyte-grazing gastropods 
Other gastropods 
Predatory polychaetes 


Susp.-f. poiych.; 
Fish-eat. birds: 
Herbivor. ducks; 
Omnivor, crabs; 
Herbivor. shrimps: 
Gulf. f. & needlefish; 
Dep.-feed. poiych.; 
E.-graz. amphip.: 
Dep.-feed. p. crustac.: 
Dep.-feed. gastrop.; 


Suspension-feeding polychaetes 
Fish-eating birds 
Herbivorous ducks 
Omnivorous crabs 
Herbivorous shrimps 
Gulf flounder & needlefish 
Deposit-feeding polychates 
Epiphyte-grazing amphipods 
Deposit-feeding peracaridan crustacean: 
Deposit-feeding gastropods 


Figure 4: Hierarchy of the St. Marks food web. a) The nodes are ordered 
according to the stationary distribution of the random walkers calculated 
from Q, and the hierarchy levels are corresponding to groups of nodes for 
which the standard deviation of is at most 0.125-it(p®*®'*), where 
denotes the standard deviation of pf‘^^ over the whole network. The colour 
coding of the nodes reflects their effective trophic level published in |34] . b) 
Listing of the abbreviations used in a). 


be also observed, (e.g., as in case of Gulf flounder &: needlefish), signing 
that the effective trophic levels and the random walk based hierarchy are 
catching slightly different aspects of the studied food web. 

Finally, the Spearman’s rank correlation coefficient between the ranking 
of the compartments according to pf'^^ and the ranking according to the 
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effective trophic levels is 0.593. In contrast, the Spearman’s rank correlation 
coefficient between the effective trophic levels and the hierarchy obtained 
after applying a degree preserving link randomisation to the network is only 
0.006 ± 0.138. Based on the above, our hierarchy is highly consistent with 
former results from ecology. 


Comparing different networks 

We also calculated the H given in (|12p for numerous different systems rang¬ 
ing from metabolic and regulatory networks through citation, trust and 
language networks to the Internet and the WWW. (A detailed description 
of the networks is given in the Supplementary Information S4). In FigjSj 
we show the obtained hierarchy values as a function of the network size, 
N. According to the figure food webs, electric circuits and regulatory net- 
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Figure 5: Random walk hierarchy of different real networks. Each symbol 
is corresponding to a different network, where the shape and colour of the 
symbols is encoding the type of the system. The horizontal coordinate of 
the symbols is corresponding to the size of the corresponding network, while 
the vertical coordinate is giving H. 

works provide the largest H values, and in contrast, the informal networks 
of acquaintances in different organisations seem the least hierarchical. In 
the mean time, the WWW, the Internet, the citation-, metabolic-, trust- 
and language networks appear to be moderately hierarchical. 

However, under certain circumstances we may obtain a moderate hier¬ 
archy measure even in a random graph. E.g., the structure of the giant 
component in the Erdos-Renyi graph [35] is more or less tree-like if we are 
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close to the percolation threshold, and tree-like structures are usually con¬ 
sidered highly hierarchical. Accordingly, in order to make a fair judgement 
on the importance of hierarchy in the topology of a real network, we need to 
compare the measured H to the result we expect in a suitably chosen ran¬ 
dom network ensemble, modelling the structure of the given network under 
the assumption of random connections. In order to take into account of the 
degree distribution of the studied networks, we use the configuration model 
for evaluating the expected value of H in the random network ensemble. A 
sample from this ensemble can be obtained by simply link randomising the 
given real network, keeping the degree of the nodes fixed under the random 
rewiring of the connections. 

The difference between H obtained for the real networks and the ex¬ 
pected value of H in their random counterparts can be measured in terms 
of the z-score, defined as 


-{H) 

(H) 


where (H) and (t{H) denote the expected value- and the standard deviation 
of H in the random ensemble, respectively. Thus, we basically scale the dif¬ 
ference between the real H and the average of H over the random ensemble 
by the standard deviation of H in the random ensemble. 

In FigiH we show the z-scores corresponding to the H values displayed 
in FiglSJ According to the results, the citation networks and the network 
between the web pages of the nd.edu domain achieve outstandingly high z- 
scores. Furthermore, all of the food webs, the Internet networks and also the 
rest of the WWW networks obtain considerably large positive z-scores. This 
means that the structure of these networks is far more hierarchical compared 
to a random network with the same degree distribution. In contrast, all 
of the regulatory- and metabolic networks have negative z-scores, (with 
rather large absolute values in the latter case). Thus, these networks are 
less hierarchical compared to what we would expect on a random base. 
Finally, in case of the electric-, organisational-, and language networks we 
see a mixed picture, where both positive and negative z-scores occur. Most 
of the organisational networks have positive z-scores, reaching to a quite high 
value in case Consulting network, while in parallel we obtain a negative z- 
score for the Enron network. The word adjacency network for the French, 
Spanish and Japanese languages have negative z-scores, opposed to a clearly 
positive z-score in case of the English language. A more detailed analysis of 
these results is provided in the Supplementary Information S4. 
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Figure 6: The z-score of H for the networks shown in FiglSl The z-score, 
given by z = {H — {H))/a{H) is plotted as a function of the system size 
N. The random null-model for evaluating (H) and (j{H) is corresponding 
to the configuration model. 


Discussion 

Measuring the significance of hierarchical organisation in the structure of a 
complex network is a non-trivial problem with a number of different options 
available. Here we have proposed a novel method based on random walks 
on the network. The basic idea behind our approach is that if nodes were 
sending instructions or information over the links to their subordinates, 
then the sources of the information could be traced using random walkers 
traversing the links backwards. The update rules of the dynamics are chosen 
in a way to make the density of the walkers on the nodes converge to a 
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stationary distribution exponentially fast with the number of iterations. The 
position of the nodes in the hierarchy is determined by this distribution, 
with high random walker densities corresponding to top nodes, and low 
values of the distribution signalling bottom nodes. The overall measure of 
the hierarchy is given by the inhomogeneity of the stationary distribution. 

The calculation of the hierarchy measure can be carried out based on re¬ 
peated multiplications of an N hy N transition matrix, making the method 
computationally very efficient and opens up the possibility for GPU based 
parallelisation. The other main advantage of our approach is that it can 
differentiate between directed acyclic graphs of distinct nature, opposed to 
most other methods treating all directed acyclic networks as already max¬ 
imally hierarchical. The random walk hierarchy measure provides higher 
scores for trees showing a multilevel pyramidal structure compared to chains 
and stars. This is consistent with a general intuitive picture about hierar¬ 
chies: the multilevel pyramidal structure enables the leaders in the tip of 
the hierarchy to reach the rest of the system via relatively short paths, and 
also avoids the “overloading” of any nodes with a too large number of direct 
subordinates. In contrast, the distance between the top and the bottom 
becomes very large in a chain, while the number of direct subordinates for 
the central node is diverging with the system size in case of a star. A fur¬ 
ther interesting property of our measure is that it is behaving similarly to 
intensive quantities in physics. I.e., for regular trees with a constant branch¬ 
ing number the hierarchy measure is converging to a well defined value in 
the thermodynamic limit. Thus, above a certain scale it is the structure, 
(encoded in the branching number), what determines the hierarchy, not the 
size of the network. 

Moreover, our tests on real world networks provided rather encouraging 
results. On the one hand, the detailed analysis of the St. Marks food web 
resulted in hierarchy levels that are highly consistent with former results 
from ecology on the effective trophic levels in the system. On the other 
hand, the large scale analysis of numerous further real networks revealed 
that the value of the hierarchy measure on its own does not always provide 
a fair characterisation of the importance of hierarchy in the structure of 
the studied system. According to our results, in some cases a relatively 
low H value can be accompanied by an outstandingly high z-score, when 
we compare the actual H to the expected value of in a randomly rewired 
network with the same degree distribution. This leads to the conclusion that 
the basic network characteristics such as the link density, degree distribution, 
etc. can inflict some constrains on the possible range of H and also on (H) in 
the corresponding random network ensemble. However, the further analysis 
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of these effects is out of the scope of the present paper and is providing 
interesting directions for further research. 


Methods 


Hierarchy of chains, regular trees and stars 

First we note that H = {) when the network is consisting of only a single 
directed cycle. Since all the nodes are equivalent in this case, = 1/N 
for all i, thus, = 0. Now let us examine how does H change if we 

move from a cycle to a chain by cutting a single link. Since all the nodes 
have a unit in-degree except the hrst node, and all the nodes have a unit 
out-degree except the last node, the transition probability from level I to 
level I — 1 is unity. Therefore, the stationary distribution on the last node 
is zero, = 0, (since the total amount of injected random walkers exit 
immediately, and there is no inflow of walkers from outside). Based on ([8]), 
the stationary distribution of the walkers on the intermediate levels fulfils 

pf“ = ^ (p!S + ^) = . (25) 


while in case of the first node the injected random walkers cannot exit the 
node, resulting in 
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By substituting into (fl^ the result simplifies to (fTBD . 

The random walk hierarchy measure for general trees with varying branch¬ 
ing number cannot be given in a general formula, nevertheless it can be cal¬ 
culated exactly for any particular finite tree based on Q and (fl^ . However, 
in case of a regular tree with branching number b, a simple recursion can be 
given for the stationary distribution of the random walkers, as the transition 
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probability from any node to its “leader” in the level above is simply 1/6. 
The random walkers cannot exit from the root, which we label as level I = 1, 
and there is an inflow of walkers from the second level, resulting in 


_stat 

Pi 


1 + / 


= e-i/A 




stat 
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pi/-'' _ I 
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which is exactly the same as in case of the first node in the chain, given 
in ()26ll . For the intermediate levels, we have an inflow of walkers from the 
level below, and also a term corresponding to the probability of the walkers 
staying at the given level instead of moving to the level above, yielding 
altogether 
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Finally, on the last level I = /max we have no inflow from other nodes, giving 


Pi 


stat 
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(31) 


which provides an immediate solution for in the form given in (I16p . 
Based on (flGl) we can calculate the stationary distribution on the rest of the 
levels as well, i.e., by rearranging (f2^ and (fHO]) we gain (flTl) and (fTSl) . 

The hierarchy measure for the star can be evaluated in a similar fashion 
to that of the chain. In this case N — 1 peripheral nodes are connected 
to a central node, from which the random walkers cannot exit. Thus, the 
stationary distribution of the random walkers fulfil 
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where denotes the density on the central node, and is equal to the 
density on the peripheral nodes. From (|33l) we can express directly as 


stat _ / IV - 2 _ - 1 N-2 

~ N f{N -1) + 1~ N (ei/A_i)(iV-l) + l’ 
and by substituting (fM|) into (l32|) we arrive to 
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(35) 


for the central node. According to (| 12 p . the random walk hierarchy measure 
can be given in this case as 


i^star = ^JN [(p«t'^t)2 + {N- l)(j>«tat) 2 ] _ l, ( 35 ) 

By substituting ([34|) and (1351) into (l36]) . the resulting formula can be sim¬ 
plified to ([15]). 


Hierarchy in the thermodynamic limit 


Taking the N ^ oo limit of iifchain given in ()13[) and of i^star written in m 
is trivial, the results are given in (|20p and in (1221) respectively. In contrast, 
the evaluating the N ^ oo limit of Htree given in ()14p is more complicated 
and can be carried out as follows. 

First we separate the first term from the rest in the sum over the levels 
in (ITTp as 
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In order to evaluate the remaining sum, we express pf^^^ given in (1171) as 
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Based on the above 
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By using that N = ^ ps if Ai >> 1, (HOll can be also written as 
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According to (1391) 
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or in terms of A if and and only 
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Thus, when (1441) . or equivalently (14511 are fulfilled, the last two terms in 
(l¥T]l vanish if oo. By using (15^ and neglecting the O terms we 

obtain 
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Now let us examine the (pits '*)2 (I37p . According to (1291) we can 

write 
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Since /max = C>(logA^) and f < 1, 
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and we obtain that 

^stat = O (1) . (50) 

As a consequence, N{ppp = O {^), which is also vanishing when N ^ oo. 
Hence, by substituting (1451) into (1571) and neglecting the O terms we 
arrive to 
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equivalent to the formula given in (|2ip . 
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Generating trees with a varying branching number 

We used the following algorithm for generating a tree with varying branching 
number between N nodes: 

(i) Initially the nodes are ordered, however, they are also completely iso¬ 
lated from each other 

(ii) We iterate over the nodes according to their order. For current node 
i we draw a number K{i) from a Poisson distribution with a fixed 
parameter a, and assign the branching number b{i) = K{i) -|- 1 to the 
node. (This way it is guaranteed that the branching number of i is 
h{i) > 1). 

(hi) We scan the nodes coming after i and stop at the first node j with no 
incoming link. We attach directed links pointing from i to the nodes 
starting from j and ending at j + h{i) — 1 . 

(iv) We repeat steps (ii)-(iii) until all nodes become connected to the tree. 

The advantage of this algorithm is that it enables the study of the swift 
transition from a chain through a family of trees with increasing average 
branching number to a star. I.e., if we set the parameter of the Poisson 
distribution to a = 0 , we obtain h{i) = 1 for all nodes, thus, the resulting 
graph is actually a chain. However, if a is large enough compared to N, the 
branching number drawn for the first node is already larger than N, thus, 
we obtain a star. For intermediate parameter values the average branching 
number of the tree is of course ( 6 ) = 1 + q;. However, the branching numbers 
of the individual nodes in the tree will deviate from this average in a similar 
manner to real systems. 
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Supplementary Information 


SI Measuring the level of hierarchical organisa¬ 
tion in a network 

Characterising the importance of hierarchy in a given network structure is a 
non-trivial problem with a large number of alternative approaches. Without 
loss of generality we can formulate a few intuitive requirements a hierarchy 
measure should meet: First, we assume no a priori ordering between the 
nodes, the measure is evaluated purely based on the topology of the network. 
Moreover, the hierarchy measure should should not be too sensitive to the 
local structure of the network, i.e., the degree sequence alone should not 
provide enough information for complete evaluation of H. In other words, 
we should be able to modify H when rewiring the links in a network without 
changing the degree sequence. E.g., let us suppose that the network is 
corresponding to a single directed cycle of N nodes, (a giant directed “ring”). 
Since all the nodes are equivalent, the network lacks any hierarchy what so 
ever, thus, H should be equal to 0. However, if we take just a single link 
away, (which is only a minor change if N is large), the network becomes a 
directed chain, which is indeed hierarchical, thus, a significant jump should 
be observed in H. 

One of the first hierarchy measure was proposed by D. Krackhardt, mo¬ 
tivated by organisational hierarchy [22]. The main assumption here is that 
in a hierarchical company we can reach the lower levels of the hierarchy from 
the levels above via chains of commands, and in contrast, we cannot reach 
the higher levels in a similar fashion from the levels below. Thus, on the 
one hand, we count the total number of ordered pairs (i,j) in the network 
for which there is directed path either from i to j or from j io i, but not 
both. On the other hand, we also evaluate the total number of connected 
pairs, for which at least one directed path exists between the two nodes, 
(but we also allow paths in both directions). By denoting the number of 
ordered pairs by A, and the number of connected pairs by C, Krackhardt’s 
hierarchy measure [22] is simply given by 

Hk = ^. (SI) 
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Based on the definition above, Hx = 1 for all acyclic networks, where 
all the node pairs are ordered. Nevertheless, Hx is a fine example for a 
hierarchy measure depending mostly on the global structure of the network: 
For a directed cycle of N nodes Hx = 0, (since we can reach any node 
from any other node, and hence, every pair is unordered). By deleting a 
single link we turn the network into a directed chain, (where all the pairs 
are ordered), and suddenly the hierarchy measure jumps to Hx = 1- 

An approach motivated by similar intuitions compared to Krackhard’s 
hierarchy was introduced by E. Mones et ah, assuming that reaching the rest 
of the network should be relatively easy for the nodes high in the hierarchy, 
and more difficult for the nodes at the bottom of the hierarchy [25] . Here the 
position of the node i in the hierarchy is determined by its reaching central- 
ity; C'r(*)) corresponding to the fraction of nodes that can be reached from 
i, (following directed paths). Based on the Cr, associated to the individual 
nodes, the Global Reaching Centrality of the whole network is defined as 


Eti [c^r^-c'R(i)] 


Hqkc = 


■»max 

1 


(S2) 


where the summation is running over the nodes, the size of the network is 
given by N and the maximal reaching centrality is denoted by 

The maximal possible value of this hierarchy measure is F^grc = 1; 
which is obtained when the network is corresponding to a star with N — 1 
arms, with only the central node having a non zero reaching centrality. 
Interestingly, for a chain of N nodes, we obtain Hqxc = 1/2 • N/{N — 1), 
which is still larger than 1/2. Similarly to the previous measure, FIgrc = 0 
for a directed cycle. 

Another way for quantifying the possible asymmetry between nodes is 
to assume some sort of flow on the links, and examine whether the global 
map of flows in the system is revealing a kind of overall directionality or not. 
Probably the simplest approach in this framework is to define the fraction of 
links not participating in any cycle as the measure of the hierarchy. I.e., the 
link flow hierarchy proposed by J. Luo and C. L. Magee can be formulated 
as [231 

^LF = (S3) 

where M^c denotes the number of acyclic links, not part of any directed 
cycle, and M is corresponding to the total number of links in the network. 

Similarly to Krackhardt’s hierarchy, the link flow hierarchy is = 1 
for all acyclic networks. Furthermore, when the network is corresponding 
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to a single directed cycle, -f/^LF = 0. Thus, when deleting a link from this 
cycle, we observe a jump from zero hierarchy to maximal hierarchy. 

A more elaborate quantification of hierarchy was proposed by B. Corominas- 
Murta et al. [26] with the introduction of treeness, feedforwardness and 
orderability, projecting the studied network onto a point in a 3 dimensional 
space, where each dimension is aimed to capture a different aspect of hier¬ 
archy. Treeness, T, is measuring how ambiguous are the chain of commands 
in the network. I.e., in a regular tree where links are pointing from higher 
levers to lower levels we obtain T = 1, whereas if revert the link directions, 
the obtained structure is considered anti-hierarchical, with T = — 1. The 
calculation of T is based on comparing forward and backward entropies [26j . 

In the mean time Feedforwardness, F is related to the size and position 
of the strongly connected components in the network. Since we can reach 
any node from any other node in a strongly connected component, we can¬ 
not define an ordering amongst these nodes. Furthermore, if the strongly 
connected component is found near the top of the overall hierarchy, its ef¬ 
fect is more severe compared to a situation where it is occurring only at the 
deeper levels. 

Finally, the orderability, O is simply the fraction of nodes not taking 
part in any directed cycles. 


O = A^ac/iV, (S4) 

where A^ac is the number of nodes that are not members in any directed 
cycles, and N is the total number of nodes. Thus, orderability is analogous 
to the link-flow hierarchy FIlf given in (jSSj), the only difference is that 
here we measure the weight of the cycles in the network by the number of 
contained nodes instead of the number of contained links. Similarly to FFlF) 
the oredrability is O = 1 for all acyclic networks, while 0 = 0 when the 
network is corresponding to a single directed cycle. 

S2 Intensiveness 

We have seen on Fig.l. in the main paper that H of finite regular trees 
is converging to the limit value for infinitely large trees already around 
N = 1000 when A = 2, while the convergence is somewhat slower when 
A = 4. Nevertheless, above a certain N it is the structure of the tree, 
(encoded in the branching number), what determines the hierarchy measure, 
not the size of the tree. This indicates that H is behaving similarly to 
intensive quantities in physics in some aspects. 


31 


To investigate this property further, let us examine what happens to H 
if we take a pair of disjoint graphs and Q 2 , and unite them into a single 
graph U Q 2 with two isolated components. According to Eq.(lO) in the 
main paper, the hierarchy measure of the graphs when considered separately 
can be written as 




Hg, 


\ 


Ni 




2=1 


\ 


N 2 


N2 *)2 - 1 , 

i=i 


(55) 

(56) 


where and correspond to the stationary distribution of the random 
walkers on nodes i G Gi and j G G 2 , while Ni and N 2 denote the sizes of 
Qi and Q 2 respectively. When considering the union of the two graphs, the 
stationary distribution of the random walkers has to be normalised over the 
whole range of A^i + N 2 nodes. Thus, the stationary distribution on node i, 
originally in Qi, now part of Qi DQ 2 , can be given as = pf^^Ni/{Ni + 
N 2 ). Likewise, the stationary distribution on node j, originally in Q 2 , now 
part of Qi U Q 2 , can be expressed as = p^j^^N 2 /{Ni + N 2 ). Therefore, 
the hierarchy measure of the union of the two graphs reads 


Afi N 2 

Hg,ug2 = . (iVi + iV2) I + 

i=l j=l 


- 1 = 


By rearranging 


\ 


A^i N 2 

^2^(pStat)2 + ^2 ^(pstat)2 

i=l j=l 


N 1 +N 2 


- 1 . 


we gam 


(S7) 


Ni 

E(pf“*)" = 

2=1 

[Hg,f + 1 

Ni ’ 

(S8) 

Ni 

= 

2=1 

[Hg,? + 1 

N 2 ’ 

(S9) 


and by substituting 


into (1571) we obtain 




'N,[Hg,f+N2[Hg,] 


N 1 +N 2 


(SIO) 
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Thus, the hierarchy measure of the union of two isolated networks is 
simply the weighted quadratic mean of H obtained for the individual graphs. 
A noteworthy consequence is that if Hg^ = Hg^, then this implies also that 
the hierarchy measure of the union is also the same, Hg^ug^ = Hg^ = Hg^. 
Furthermore, according to (jSlOD in general, if we take the union of Qi and 
Q 2 which are isolated from each other and Hg^ < Hg^, then the hierarchy 
measure of the resulting network will be between the hierarchies measured 
for the two graphs separately, 

Hg^ < Hg^yjg^ < Hg^. (Sll) 

Therefore, the rule for calculating H for a system composed of isolated 
smaller parts based on H obtained for these sub-systems is showing again a 
great deal of similarity to the behaviour of intensive physical quantities. 


S3 The trees of maximal hierarchy 


According to our results in the main paper, when considering regular trees 
of inhnitely large size, the hierarchy measure may either diverge or remain 
finite, depending on both the branching number b and the value of the A 
parameter. At a fixed b, the critical \c{b) separating the two regimes is given 
by Eq.(23) in the main paper. Here, in Fig ISll we show Ac as a function 
of 6 on a semi-logarithmic plot, where the regime of diverging H values is 
coloured grey. 

In the regime of finite H, the limit value for the hierarchy measure is 
given by Eq.(21) in the main paper. The maximum of this function can be 
located by solving 

- 1) - - 1) - 2 = 0. (S12) 


Based on that, at a hxed branching number 6, the A parameter providing 
the maximal H can be written as 


A* 


In 


62-26 



(S13) 


while at a hxed A parameter the tree with maximal hierarchy has a branching 
number of _ 


a/62/"^ — 1 

In Eig ISlI we also show A* given in (IS13I) as a function of 6. In parallel, 
the 6* expressed in (IS14h is plotted as a function of A in EiglS21 According 
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Figure SI: The “phase diagram” for H in case of regular trees in the thermo¬ 
dynamic limit. If A and b are falling into the region shown in grey, (meaning 
that A is larger than Xc(b) calculated from Eq.(23) in the main paper), then 
H is diverging. In contrast, for parameter settings falling in the white re¬ 
gion, H remains finite in the thermodynamic limit. The maximal H in this 
region is obtained at a A*(6), given in (|S13p . The dashed horizontal line is 
showing the minimum of Ac (6). 


to picture, the maximum of is at 5 = 2 when A is low. However, h* is 
steadily increasing as a function of A, reaching to b* ~ 3.84 at A = 4, chosen 
to be the optimal A value based on arguments given in the main paper. 

S4 Hierarchy of real networks 

We have analysed the hierarchy in a large number of networks of different 
types and varying sizes, the scatter plot of the obtained H values and the 
z-score of H are given in Figs.5-6. in the main paper. In Tables IS1IIS21 we 
provide the results in more details by listing also the size and the average 
degree beside the H values and z-scores of the examined networks. 

The electric networks, where the target of a directed link is depending on 
the value of the source node, turned out to be rather hierarchical according 
to our measure. Although their sizes is varying between N = O(IO^) and 
N = O(IO^), the hierarchy values are quite close to each other, forming an 
elongated cluster in Fig.5. in the main paper. The analysed two citation 
networks, where the links are pointing from the cited paper to the citing 
article, showed only a moderate amount of hierarchy. In contrast, a part of 
the food webs were amongst the most hierarchical networks in the studied 
examples. However, the H values showed a relative large variance for this 
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Figure S2; The branching number b* for which H is maximal, as a function 
of the parameter A in case of infinitely large regular trees. The vertical line is 
corresponding to the minimum of Xc{b) given in Eq.(23) in the main paper, 
thus, for A values above that H can become divergent in the thermodynamic 
limit. 

network type, which can be due to the different habitat of the species in¬ 
volved in the listed food webs. Furthermore, in case of the St. Marks food 
web the links were weighted, with the weights corresponding to the fractions 
of the consumers diet, whereas in the other cases the links were un-weighted. 

The peer to peer message networks over the Internet and the metabolic 
networks, (where the target of a link was corresponding to a product of the 
source) showed moderate levels of hierarchy. For both types, the data points 
in Fig.5. in the main paper formed tight clusters. The intra organisational 
networks were the least hierarchical according to our measure, with the 
data points forming a cluster close to the origin. Similarly to the food 
webs, the variance of the H values obtained for the regulatory networks was 
relatively high. I.e., the regulatory networks TRN-Yeast-2 and TRN-Yeast- 
1 are amongst the most hierarchical studied systems, whereas the TRN-EC 
network shows only a modest hierarchy value. 

The trust networks obtained moderate hierarchy values, and the variance 
of their H values is far smaller compared to the very large variance in their 
sizes. The language networks turned out to be moderately hierarchical as 
well. Here the networks were originating from large text corpora, and a 
directed link from word H to R is signaled that B was following A in the 
text. According to the results, the English- French- and Japanese language 
networks obtained H values quite close to each other, while the hierarchy 
of the Spanish language seemed considerably lower. Finally, the networks 
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Type 

Meaning of T —^ B 

Network 

N 


H 

Electric 

B depends on 

sl488 Eg 

667 

2.085 

0.893 


the value at A 

sl494 [36] 

661 

2.116 

0.880 



s5378 [36] 

2993 

1.467 

0.887 



s9234 [36] 

5844 

1.400 

0.870 



S35932 [36] 

17828 

1.683 

0.719 

Citation 

A is cited by B 

ArXiv-HepPh [37] 

34546 

12.203 

0.352 



ArXiv-HepTh [37] 

27770 

12.705 

0.430 

Food web 

A eats B 

GrassLand [38] 

88 

1.557 

1.551 



LittleRock [sg 

183 

13.628 

0.482 



St. Marks [10] 

49 

4.612 

0.634 



Ythan [38] 

135 

4.452 

0.677 

Internet 

A sent messages 

p2p-l [11] 

10876 

3.677 

0.524 


to B 

p2p-2 m 

8846 

3.599 

0.541 



p2p-3 [H] 

8717 

3.616 

0.529 

Metabolic 

B is an end 

C. elegans [ig 

1173 

2.442 

0.467 


product of A 

E. coli [42] 

2275 

2.533 

0.447 



S. cerevisiae [42] 

1511 

2.537 

0.440 

Organization 

B trusts in A 

Consulting [43] 

46 

19.109 

0.120 



Enron [44] 

156 

10.699 

0.126 



Manufacturing [43] 

34 

18.935 

2.18 X 10"^ 


B knows A 

Freemans-1 [4^ 

34 

18.971 

6.35 X 10"^ 



Freemans-2 [45] 

77 

24.412 

1.06 X 10"^ 

Regnlatory 

A regulates B 

TRN-Yeast-1 [ig 

4441 

2.899 

0.262 



TRN-Yeast-2 |47| 

688 

1.568 

0.720 



TRN-EC [47] 

419 

1.239 

0.823 

Trust 

B trusts in A 

College [ig 

32 

3.000 

0.464 



Epinions [49] 

75888 

6.705 

0.437 



Prison [Og 

67 

2.716 

0.565 



WikiVote [51] 

7115 

14.573 

0.201 

Language 

word B 

English [^ 

7724 

5.992 

0.404 


follows word A 

French [sg 

9424 

2.578 

0.478 



Spanish |52| 

12642 

3.570 

0.194 



Japanese [Sg 

3177 

2.613 

0.497 

World Wide 

B has a link to A 

Google web [53] 

15763 

10.861 

0.258 

Web 


nd.edu [54] 

325729 

4.596 

0.557 



Polblogs [Sg 

1490 

12.812 

0.223 


Table SI: Random walk hierarchy of real networks shown in Fig.5. in the 
main paper. The network type is given in the 1®* column, the meaning of 
the links in the 2°*^ column, the references to the data sources are listed in 
the 3^*^ column. The network size is given in the 4*^ column, followed by the 
average degree in the 5*^ column. The hierarchy measure H (calculated at 
the optimal A = 4 parameter value) is provided in the 6**^ column. 

between web pages showed again a moderate hierarchy, where H for the 
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Type 

Network 

H 

(-^rand) i ^(-^rand) 

z 

Electric 

sl488 [36] 

0.893 

0.811 ± 1.66 • lO""^ 

4.94 


sl494 [36] 

0.880 

0.780 ± 1.83 • 10“^ 

5.46 


s5378 [35] 

0.887 

0.803 ± 1.65 • 10-2 

5.09 


s9234 [36] 

0.870 

0.861 ± 1.23 • 10-2 

0.73 


S35932 [36] 

0.719 

0.787 ±5.44 • 10-® 

-12.5 

Citation 

ArXiv-HepPh [37] 

0.352 

0.222 ± 5.16 •lO-'^ 

251.94 


ArXiv-HepTh [37] 

0.430 

0.221 ±2.46 • 10-® 

84.96 

Food web 

GrassLand [3^ 

1.551 

1.036 ±6.20 • 10-2 

8.31 


LittleRock [39] 

0.482 

0.221 ± 1.68 • 10-2 

15.54 


St. Marks [40] 

0.634 

0.425 ± 3.44 • 10-2 

6.08 


Ythan |38| 

0.677 

0.450 ±2.19 • 10-2 

10.37 

Internet 

p2p-l [H] 

0.524 

0.488 ± 1.48 • 10-® 

24.32 


p2p-2 [H] 

0.541 

0.499 ± 1.62 • 10-® 

25.92 


p2p-3 [H] 

0.529 

0.495 ± 1.61 • 10-® 

21.12 

Metabolic 

C. elegans [33] 

0.467 

0.611 ± 1.20 • 10-2 

-12.00 


E. coli [42] 

0.447 

0.630 ±8.99 • 10-® 

-20.36 


S. cerevisiae [42] 

0.440 

0.609 ± 1.14 • 10-2 

-14.82 

Organization 

Consulting [43] 

0.120 

4.99 • 10-2 ± 5.24 • 10-® 

13.38 


Enron [44] 

0.126 

0.160 ± 1.44 • 10-2 

-2.36 


Manufacturing [43] 

2.18 X 10“^ 

2.04 • 10-2 ± 1.27 • 10-® 

1.10 


Freemans-1 [3^ 

6.35 X 10-2 

6.01 • 10-2 ± 1.72 • 10-® 

1.98 


Freemans-2 [^ 

1.06 X 10“^ 

9.42 • 10-^ ± 1.20 • 10-® 

0.98 

Regulatory 

TRN-Yeast-1 [46] 

0.262 

0.264 ±2.39 • 10-® 

-0.84 


TRN-Yeast-2 [47] 

0.720 

0.733 ± 6.45 • 10-® 

-2.02 


TRN-EC [13 

0.823 

0.853 ±8.22 • 10-® 

-3.65 

Trust 

College [4^ 

0.464 

0.403 ± 2.82 • 10-2 

2.16 


Epinions [49] 

0.437 

0.373 ±5.15 • 10-® 

12.43 


Prison [^ 

0.565 

0.565 ±4.94-10-2 

0 


WikiVote [51] 

0.201 

0.157 ± 1.49 • 10-® 

29.53 

Language 

English [52] 

0.404 

0.388 ± 4.89 • 10-® 

3.27 


French [52] 

0.478 

0.519 ±4.64 • 10-® 

-8.83 


Spanish [52] 

0.194 

0.241 ±4.03 • 10-® 

-11.66 


Japanese ^2\ 

0.497 

0.560 ±8.24 • 10-® 

-7.65 

World Wide 

Google web [53] 

0.258 

0.246 ± 1.68 • 10-® 

7.14 


nd.edu [54] 

0.557 

0.451 ± 7.84 • lO-'^ 

135.20 


Polblogs [55] 

0.223 

0.184 ±8.21 • 10-® 

4.75 


Table S2: Comparing H in real networks to that of their link randomised 
counterparts. The network type is given in the 1®* column, the references 
to the data sources are listed in the 2°'^ column, and the H measured in 
the original networks is given in the 3’’*^ column. The average value of H 
in the configuration model, (calculated via link randomisation) is presented 
in the 4**^ column, accompanied by the standard deviation. Finally, the 
corresponding z-scores are listed in the 5*^ column. 
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Google web and Polblogs were quite close to each other, while network of 
nd.edu received a significantly higher hierarchy value. 

It is very interesting to see how the picture is changing when we switch 
from H to the z-score shown in Fig.6 in the main paper, corresponding to the 
difference between H and {H) in the configuration model, scaled by stan¬ 
dard deviation of H in the configuration model. The citation networks, the 
Internet networks and the food webs are forming three rather tight clusters 
in Fig.6. in the main paper with significant positive z-score, (which is out¬ 
standingly high in case of the citation networks). In contrast, the metabolic- 
and regulatory networks are forming clusters of significant negative z-score, 
indicating that their structure is far less hierarchical compared to what we 
would expect by assuming random connections between the nodes, (at the 
same degree distribution as in the original network). The z-scores of WWW 
networks were all positive, (where the nd.edu network showed an outstand¬ 
ingly high value). In case of the trust networks the z-scores are also almost 
always positive, with the exception of the Prison network, where z = 0. The 
organisational-, electric-, and language networks showed a mixed picture, 
including both positive and negative z-scores. 
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